Optimal Dimensionality Reduction of Complex Dynamics: 
The Chess Game as Diffusion on a Free Energy Landscape. 
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Dimensionality reduction is ubiquitous in analysis of complex dynamics. The conventional dimen- 
sionality reduction techniques, however, focus on reproducing the underlying configuration space, 
rather than the dynamics itself. The constructed low-dimensional space does not provide complete 
and accurate description of the dynamics. Here I describe how to perform dimensionality reduction 
while preserving the essential properties of the dynamics. The approach is illustrated by analyzing 
the chess game - the archetype of complex dynamics. A variable that provides complete and accu- 
rate description of chess dynamics is constructed. Winning probability is predicted by describing 
the game as a random walk on the free energy landscape associated with the variable. The approach 
suggests a possible way of obtaining a simple yet accurate description of many important complex 
phenomena. The analysis of the chess game shows that the approach can quantitatively describe the 
dynamics of processes where human decision- making plays a central role, e.g., financial and social 
dynamics. 
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INTRODUCTION. 

Complex processes are often described by a single vari- 
able to simplify their analyses. Examples include order 
parameters in physics P, Q , biomarkers in medicine 0- 

, indexes and asset prices in economics @, 0| , citation 
counts in bibliometrics [1, Q, and educational grades. 
To have descriptive and predictive povifcr, or serve as 
an optimization target, the variable should completely 
specify the current state and future dynamics of the pro- 
cess. Construction of such variables is challenging. Con- 
ventional dimensionality reduction techniques, such as 
principal component analysis, and their generalizations 
[lOl [Tl| often fail to produce such variables . The tech- 
niques focus on compact representation of an ensemble of 
configurations (the configuration space). The dynamical 
information contained in the temporal sequence of the 
configurations (trajectory) is ignored. 

If the future dynamics is completely specified by the 
current value of the variable and does not depend on 
history or the values of other variables, the dynamics is 
said to be Markovian. In this case the dynamics of the 
variable reproduces the original dynamics, i.e., the pro- 
jection preserves the dynamics. The stochastic dynamics 
of the variable can be simply described as diffusion on a 
free energy profile and is completely specified by the free 
energy profile and the diffusion coefficient. Conversely, 
diffusive dynamics implies Markovianity. 

Here I describe a method of constructing variables (the 
optimal reaction coordinate hereafter) which preserve the 
dynamics upon the projection. The method originated 
in the protein folding field, where the reaction coordi- 
nate and the associated free energy landscapes are used 
to described in a simplified yet accurate way the com- 
plex dynamics of protein folding (see inset of Fig. 1) 
Il3j . The optimal reaction coordinate is constructed 
based on the system dynamics. Given reaction coordi- 
nate time series X{iAt), the cut-based free energy pro- 
file (cFEP) can be constructed Fc{x)/kT = —lnZc{x), 



where partition function Zc{x) equals half the number 
of transitions performed by the reaction coordinate times 
series through point x [13, Ell (for details see Appendix). 
The cFEP is complementary and superior to the conven- 
tional histogram based free energy profile as it is invariant 
to reaction coordinate rescaling, insensitive to statistical 
noise and capable of detecting sub-diffusion. Together 
they determine the coordinate dependent diffusion coef- 
ficient D{x) and thus completely specify diffusive dynam- 
ics [lil . The optimal reaction coordinate is the one with 
the highest cFEP [ij, [3 with the following rationale. 
It generalizes the definition of the transition state as the 
minimum cut to any position on the reaction coordinate 
[l6j . Projection on a "bad" reaction coordinate results 
in smaller barriers and faster kinetics due to overlapping 
of different parts of the configuration space. The optimal 
coordinate exhibits the slowest kinetics [TJ] . The dynam- 
ics projected on this coordinate is closest to diffusive [11] . 
A putative functional form of the reaction coordinate is 
proposed based on a general understanding of the pro- 
cess. For example, a linear combination of "features" 
that could describe the process. For protein folding it 
could be a weighted sum of distances between atoms of 
a protein [l5| . The coordinate is optimized (trained) on 
a sample of trajectories representing realizations of the 
process. The coefficients of the functional form are nu- 
merically optimized to make the cFEP along the coordi- 
nate the highest. 

To emphasize the power and generality of the ap- 
proach the dynamics of the chess game is analyzed 
|17| . The chess is a model system of research in arti- 
ficial intelligence. Its complex dynamics is not gener- 
ated by a physical system and thus applicability of the 
free energy landscape framework is not evident. The 
ga mes played by the computer pro gram GNUCHESS 
(http://www.gnu.org/software/chess) against itself are 
analyzed here. No generality is lost, since computers 
surpassed humans at chess when the Deep Blue won the 
rematch with the World Chess Champion Garry Kas- 
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FIG. 1. (color online) Cartoon chess free energy land- 
scape. The game is described as a random walk (diffusion) 
on the free energy landscape (red line). Starting from the 
middle the game continues until either the right (white wins) 
or the left (black wins) end of the profile has been reached. 
Lower barrier for white reflects that white has more chances 
to win. The boards show representative positions for the re- 
gions on the landscape. The inset shows protein folding free 
energy landscape. 



parov (http://www.research.ibm.coni/deepblue). The 
idea that the chess game can be described by a random 
walk may seem rather extravagant (Fig. 1). To gain 
an advantage, players devise precise sequences of moves, 
which arejmything but random. However, in contrast to 
checker 18] the chess are not solved yet. One can not 
tell the result of a chess game starting from any position 
if played optimally. The result can only be guessed, sug- 
gesting applicability of stochastic description to the chess 
game, which is corroborated by results presented below. 

To keep the analysis one-dimensional only games that 
end in a victory are considered. 10000 games, initially 
preprocessed (see Appendix) are used for the analysis. 
As a putative reaction coordinate the evaluation function 
E{p), used in computer chess, is chosen. The function 
gives a quantitative estimation of the value of a position 
(p) and is a weighted sum of various factors (the major 
being the material factor) (l9| . For example, a pawn 
has a material value of 100 and a queen of 1100, so that 
E{p) = lOO(pu) -pb) -I- 1100(Qu, - Qb) + where p^,, Pb 
and Qu), Qb are the number of white and black pawns and 
queens on the board, respectively, and ... includes other 
factors describing more subtle properties of a position, 
such as, board control, mobility, pawn structure, passed 
pawns, etc. Note that alternative functional forms of 
reaction coordinate (e.g., the artificial neural networks) 
can be employed. 



REACTION COORDINATE OPTIMIZATION. 

The chess game has a non-equilibrium dynamics. The 
james proceed from the starting position to a checkmate 



and never backwards. In this case the dynamics is com- 
pletely specified by (x) and (x) which measure the 
fiux (number of transitions through point x) in the posi- 
tive and negative directions, respectively (Fig. 2a). Their 
difference manifests the non-equilibrium character of the 
dynamics and equals (for positive x) to the number of 
games won by white. The exponent a shows that dy- 
namics along E{p) is sub-diffusive a ~ 0.3. The expo- 
nent measures how the amplitude of the random jumps 
(changes of the position) scales with time Ax ^ At"'; 
for diffusive dynamics a = 0.5 and Ax ~ V At. The 
sub-diffusive dynamics indicates that the projected dy- 
namics is not Markovian (consequent displacements are 
anti-correlated) and that the putative reaction coordi- 
nate is not optimal, i.e., its value alone does not com- 
pletely specify the dynamics. 

It has been shown that sub-diffusive dynamics in pro- 
tein folding is observed when a sub-optimal reaction coor- 
dinate is used for description ^ISj . Indeed, the evaluation 
function E{p) employed in computer chess is a poor re- 
action coordinate. It can distinguish between positions 
where white (or black) has a clear advantage. Positions 
with more subtle advantage or at highly dynamic phases 
of the game (e.g., during a.n exchange of pieces) can not 
be accurately evaluated [1^ . In order to accurately evalu- 
ate a position, the computer performs an extensive search 
over all possible continuations of the position to a signif- 
icant extent and selects the one that maximizes the min- 
imum gam Such brute force number-crunching is in 
contrast with a human way of playing chess with creativ- 
ity and intuition [2d| . Here, instead, the coordinate is op- 
timized by making the Fc higher. The numerical param- 
eters oi E{p) (e.g., the queen's material value) were (iter- 
atively) randomly modified and kept if the modification 
resulted in a higher Fc (for details see Appendix). The 
cFEPs for the optimized coordinate E(j)) are marginally 
higher (by 0.3 around x=0. Fig. 2b) than that for the 
sub-optimal case (Fig. 2a). a ~ 0.5 indicates that the 
dynamics is diffusive and Markovian and that the op- 
timized reaction coordinate completely specifies the dy- 
namics, i.e., is the optimal reaction coordinate. 

The chess game and the protein folding [l^, both il- 
lustrate useful generic property of the cFEPs: the higher 
is the profile the more diffusive is the dynamics, i.e., the 
dynamics is not sub-diffusive per se. Consider two rea- 
sonably good reaction coordinates which differ locally but 
give similar large-scale description of dynamics. At suffi- 
ciently large times scale (Ai^), when memory effects due 
to sub-optimal projection [2l| can be neglected and dy- 
namics is Markovian, the coordinates have similar cFEP. 
The sub-diffusion exponent a can be estimated from the 
distance between the profiles computed with the large 
time step {At2) and the original (small) time step (A^i): 
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The higher is the cFEP (the smaller Zc) at the original 
time scale of Ati , the smaller is the distance between the 
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FIG. 2. (color online) Free energy profiles of the chess game, a) (dashed red), Z,^, (dotted blue) and a (solid black) 
for the sub-optimal E{p) reaction coordinate, b) Z^ (dashed red), Z^, (dotted blue) and a (solid black) for the optimal E{p) 
reaction coordinate, c) The chess game is described as diffusion on the (equilibrium) cFEP. The cFEP is computed with the 
free energy profile (solid red) and the Markov network (dashed blue) frameworks (from the cFEPs on panel b). E{p) and E{p) 
are transformed (rescaled) to X so that the diffusion coefficient D{x) equals to unity. 



profiles, the larger is a and more diffusive is the projected 
dynamics. 



the games of 0.59; AF/kT = F^/kT - F^/kT 
(Fig. [2fc). 



-0.4 



THE EQUILIBRIUM FEP. 

The Z^{x) and Zf^{x) (Fig. 2b) computed from the 
non-equilibrium trajectories do not represent the under- 
lying (equilibrium) FEP. The latter, however, can be 
computed from them (Fig. 2c) (see Appendix). In or- 
der to emphasize the robustness of the results, they are 
recomputed with the (complementary) Markov network 
formalism (see Appendix), which describes the dynam- 
ics by a network of transitions between different states 
[2^ . Though the profile computed with the Markov net- 
work shows some noise, the results obtained with both 
frameworks are in very good agreement. 

The chess game (as a whole) is described in a simple, 
while accurate way as diffusion (with D{x) = 1) on the 
profile. A game represents a particular realization of the 
stochastic diffusion process. It starts at x = and con- 
tinues until either the right (white wins) or the left (black 
wins) end of the profile has been reached. The relatively 
flat region of the profile (|a;| < 2.5) suggests that, initially, 
a trajectory (game) may switch many times between pos- 
itive and negative parts. It describes the constructive, 
search phase of the game, where the opponents are try- 
ing to get an advantage. After the barriers (|x| > 2.5) 
the profile is much steeper, making a return to the op- 
posite part very unlikely, meaning that the barriers are 
the rate limiting step in the game dynamics. As soon 
as a decisive advantage has been gained (a barrier has 
been overcame) the game strategy becomes much sim- 
pler: it is sufficient to exchange the pieces, while keeping 
the advantage. The probability of overcoming a barrier 
can be roughly estimated as half of the probability of be- 
ing at the top of the barrier pi ^ 0.5 exp{—Fi/kT), where 
Fi is the barrier height [2^ . The winning probability (of 
white) P = p^/ip^+Pb) = e-^^/^-^/(l+e-^^/^^) ~ 0.6 
is in agreement with the number computed directly from 



THE PROBABILITY TO WIN AND GAME 
ANALYSIS. 

A more stringent test is to compare the winning proba- 
bility P{x) for any position a;, which is the probability to 
reach the right end of the profile, before reaching the left 
end. It is analogous to the folding probability or com- 
mitor used in protein folding studies [U, [2^. Fig. 
shows P{x) calculated directly from the trajectory, from 
the diffusive dynamics on the cFEP and with the Markov 
network framework (for details see Appendix) . They are 
in a very good agreement. P{x) computed with the sub- 
optimal coordinate (Fig. 2a) notably disagree (Fig. 3b) 
because dynamics along this coordinate is neither dif- 
fusive nor Markovian. Naive approach to estimate the 
winning probability by collecting statistics for every po- 
sition is impractical due to the sheer size of the configu- 
ration space (estimated as lO"*"^ [l^ ). Markovian coarse- 
graining of the space is essential, and is obtained here by 
the projection onto the optimal coordinate. Once con- 
structed, the optimal reaction coordinate can be used to 
analyze a chess game in simple terms by showing the evo- 
lution of the winning probability. As an example, game 
between Garry Kasparov and Veselin Topalov played in 
Hoogovens in 1999 is considered (Fig. 3c). Initially, black 
equalizes the chances to win the game (at 10-th move). 
Then, white increases the chances to win up to P = 0.7 
(at 23-rd move). The part of the game between 24-th and 
38-th moves (dashed line) which starts with the rook sac- 
rifice can not be analyzed due to the shortcomings of the 
evaluation function. More sophisticated variants of the 
evaluation function should make possible the analysis of 
the entire game. 

The optimization procedure effectively tries to de- 
crease the size of each step, so that the chess dynamics 
along the optimal reaction coordinate consists of small 
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FIG. 3. (color online) The probability to win. a) The probability (for white) to win the game starting from any position 
computed along the optimal coordinate E{p): with diffusion on the cFEP (solid red) and with the Markov network framework 
(dashed blue), and directly from the played games (crosses). The inset shows the plot in logarithmic scale, b) that along the 
suboptimal reaction coordinate E(p). c) The analysis of the game between Garry Kasparov and Veselin Topalov: probability 
to win vs the move number. 



incremental changes (e.g., moves 1-23 on Fig. 3c). How- 
ever, the conclusion that a "brilliant" move, which alone 
can change the course of a game is impossible, is wrong. 
Since the game is described as a random process, there is 
non-zero Gaussian probability to have a large move along 
the reaction coordinate p(Ax) ~ e~^^ (the diffusion 
coefficient D = 1). 

CONCLUDING DISCUSSION. 

An optimization (learning) principle to simultaneously 
optimize the reaction coordinate and the playing strategy 
can be suggested. Z^{x) (Fig. 2b, blue) which counts 
the "retrograde" moves, is an indicator of the quality of 
play. The quality consistently improves with increasing x 
since the complexity of the game decreases. When moves 
are chosen according to the "perfect" winning strategy 
Z'^{x) = 0. Hence, Fc{x) attains the upper bound 
{Zq{x) = 0) for the best playing strategy and the op- 
timal coordinate describing it. The principle can be used 
to measure and optimize performance of stochastic al- 
gorithms, for example, to improve global optimization 
heuristics (26| . 

The presented analysis can be improved in the follow- 
ing ways: the development of more sophisticated variants 
of the evaluation function to treat the dynamic phases of 
the game; the construction of reaction coordinates tai- 
lored to different types of positions, or perhaps reaction 
coordinate which is iteratively updated during the game; 
two dimensional free energy landscape to allow analysis 
of games ending in a draw. 

Assuming that the chess game is a stochastic process 
the optimal coordinate that provides accurate description 
of the process has been constructed. The cut free energy 
profile is a function of the reaction coordinate time se- 
ries alone. Detailed specification of the dynamics (even 
the rules of the game) is not necessary to perform the 
optimization. It allows the approach to analyze phenom- 



ena which numerous characteristics can be monitored, 
while construction of the complete dynamical model is 
impractical. The analysis of the chess game shows that 
the approach can quantitatively describe the dynamics of 
processes where human decision-making plays a central 
role, e.g., financial and social dynamics. 

An interesting application is the construction of dis- 
ease biomarkers Q , specifically for such difficult cases 
as cancer Q, ageing [l^ or psychiatric disorders An 
optimal biomarker and the associated free energy land- 
scape that give an accurate description of the disease 
dynamics can be constructed out of panel of monitored 
parameters e.g., metabolomic, proteomic and genomic 
data. Diffusive dynamics would indicate that the optimal 
biomarker gives complete (Markovian) description, and 
sub-diffusive that some essential information is missing. 
Unlike conventional approaches, the proposed approach 
explicitly treats the dynamical character of the process. 
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APPENDIX 

The cut free energy profiles. Given reaction coor- 
dinate {R{X)) and an ensemble of trajectories Xi{j/S.t) 
sampled with interval Ai the ensemble of reaction coor- 
dinate trajectories is defined as Xi{i/S.t) — R{Xi{jAt)). 
The partition function of the conventional (histogram- 
based) free energy profile is estimated as 

ZHix)^N,/Ax, (1) 

where is the number of trajectory points in bin x and 
Ax is the size of the bin. Tlie partition function of the 
cut based free energy profile [IJ] at point x is estimated 
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as half the number of transitions through that point, i.e., 
Zc{x) = 1/2^ e{(a:,(jAt) - x){x - x,{jAt + At)^ 

where Q{x} is the Heaviside step function. 

For Gaussian distribution of steps P[Ax) ^ 
exp(-Aa;V(4I>At)) and assuming that Fh{x) is approx- 
imately constant on the distance of the mean absolute 
displacement (|Aa;(At)|), one obtains |14| 



Zc{x) = ZH{x)^D[x)At/n- 



(3) 



For non-Gaussian distribution of steps, observed here, 
assuming (Aa;^) — 2D At, Eq. [3] is corrected by factor v 



Zc{x) = vZH{x)^D{x)At/Ti 
v= {\Ax\)/^2{Ax-')/^ 



(4) 
(5) 



The correction factor v is assumed to be coordinate inde- 
pendent and is computed from the entire trajectory. The 
distribution of steps is expected to converge to Gaussian 
for more sophisticated evaluation functions, where each 
move changes a significant number of terms of the func- 
tion. 

When dynamics is not equilibrium the net trajectory 
flux is not zero and the detailed balance is not satis- 
fied. In this case positive and negative cut profiles that 
measure the flow in the corresponding direction are in- 
troduced 

Z+{x) ^^Q{x~ x^{jAt))Q{xi{jAt + At) - x) (6) 

Zci^) = - - x^{]At + At)). (7) 

Diffusion with constant flux J and gradient 
F(x) — ax. The flux for the steady state solution Pst{x) 
of the Smoluchowski equation is 



J 



where /3 — 1/kT and D{x) = D. Steady state distribu- 
tion is found as 



Pstix) = Zh{x) 



J 



D(3a 



Ce 



where C is a constant defined by a boundary condition. 
Distribution of displacements Ax during time step of At 
is 

/A A N 1 r {Ax + Daf3At)^^ 

p{Ax, At) = cxp[ ^ ^ ^ 1 



V47rDA< 



ADAt 



, the number of transition from y < Q to y > Q equals 

to 



= I dyZniy) / p{x,At)dx. 



Zfj , the number of transition from j/ > to y < equals 
to 

= dyZniy) / p{x,At)dx. 

Jo J -QC 

Integrating over y one obtains 



dxp{x, At){- 



Jx „ 1 - e^" 



C- 



D(3a /3a 



The net flow equals to 



dxp{x, At) 



Jx 
D(3a 



JAt 



Zc = {Z^ + Z(j)/2 is found by expanding the exponents 



Zc = ZHVDAt/^il + OiDAtp^a^)) 
The equilibrium FEP. F{x) can be found as 

J{x)dx 



F{x)/kT = - In Pst{x) 



D{x)Pst{x)' 



Using Pstix) = Zh{x), J{x)At = Z+{x) - Z^{x) and 
D{x) = [Zc/ZHfv-'^TTlAt, one obtains 

Fix)/kT^-XnZuix)- r^-(^^W-5cW)^.Wfe 



nZlix) 

The natural coordinate. A reaction coordinate (x) 
with variable diffusion coefficient can be transformed to 
the natural coordinate (y) with the diffusion coefficient 
equals to unity by numerically integrating [14] 



dy = u ^ \J At/'KZh{x)/Zc{x)dx. 



(9) 



In this case the diffusive dynamics is completely specified 
by the free energy profile only. 

The sub-difFusion exponent. In case of sub- 
diffusive dynamics, the mean absolute displacement 
scales with time as (|Ax(At)|) ^ At", where a < 0.5. 
The coordinate dependent exponent a is determined b; 
comparing Zc{x) at two different sampling intervals 



L by 

m 



.^ , , lnZc(x,Ati)~lnZc(x,At2) 

a(x) = 1 H — — . (10) 

^ ' In Ail - In Atz 

Reaction coordinate optimization. The optimal 
reaction coordinate is defined as the one that has cut 
based free energy profile Fc{x) highest for every value 
{x) of the coordinate |14l4la |. It generalizes the defini- 
tion of the transition state as the minimum cut to any 
position on the reaction coordinate jT6| . Projection on 
a "bad" reaction coordinate results in smaller barriers 
and faster kinetics due to overlapping of different parts 
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of the configuration space. The optimal coordinate ex- 
hibits the slowest kinetics The dynamics projected 
on this coordinate is closest to diffusive [3] . 

The coordinate as a whole is optimized by numerically 
maximizing J Z^^{x)Z}i{x)dx. For a flexible form of the 
reaction coordinate where Zc{x) for all x can be con- 
sidered as independent the functional attains the maxi- 
mum when all Zc{x) are minimal (all Fc{x) are maxi- 
mal). For a less flexible reaction coordinate the optimum 
may be a compromise solution where different parts of 
the coordinate are optimized to a different degree. An- 
other optimization functional max J Z^^{x)ZH{x)dx ^ 
J gF(x)/fcT/£,(2;)da;, which assigns more weight to the 
higher part of the profile, gives very similar results. For 
the case of over-damped Langevin dynamics 



attains the maximum when p is the committor function 
{p'^"™'^ {X)) or the folding probability, an alternative def- 
inition of the optimal reaction coordinate [l^ . 

The reaction coordinate is a weighted sum of different 
components of the evaluation function E{p) = ^ aifi{p), 
where are the weights, fiij)) are the components of the 
evaluation function and p denotes the position. Start- 
ing with the initial set of the weights a° (that of the 
GNUCHESS program) the coordinate is iteratively im- 
proved by randomly changing the weights and accept- 
ing the change if the value of the functional is increased. 

The cFEP is invariant with respect to local changes 
of scale of the evaluation function (gauge invariant), it 
depends only on the relative order of the points. This 
makes the construction of the optimal coordinate much 
easier, since a putative reaction coordinate E{p) should 
only reproduce the relative order, not the absolute value 
of the goodness of the positions. 

The Markov network framework. The Markov 
network describing the dynamics is constructed by parti- 
tioning the reaction coordinate into bins of size 0.005 
and computing the transition probabilities as pij 



j lu^j, where is the number of transitions from 
bin J to bin i. The reaction coordinate was converted 
to the natural coordinate to ensure uniform partition. 
The equilibrium FEP was constructed by computing the 
equilibrium network n^J 



where p^ are the equilibrium populations 

en \ ^ I eq 



(11) 



(12) 



and the sub-network p'^^ is the strongly connected compo- 
nent of the network, i.e., there is a path in each direction 
between any two nodes of the sub-network. In particular, 
the terminal (checkmate) nodes are discarded. Zc and 



Zh are computed as 

Zcix) = l/2^<;e{(x(^) -x)ix- xm (13) 



Zh{x)= J2 



eq 



(14) 



where x{i) and x{j) are the positions of the clusters i and 
j, respectively. 

The probability to win (the committor or the fold- 
ing probability in the analysis of protein folding dy- 
namics) measures probability to reach point x = b be- 
fore reaching point x = a. For the diffusion on a one- 
dimensional free energy profile it can be estimated [1^ 
as 



^^-y'^/Dix)dx/ / e^(^)/^^/i^(a;)dx. 



which can be expressed as 

Znix)Z-^{x)dx 



P 



\x) 



Zh{x)Zc ^{x)dx 
It can be estimated from the Markov network as 



coram 
Pi 



3 



(15) 



(16) 



withp^°!g™ = 1 andp[i°'^™ = 0, where 1-0 and 0-1 are the 
nodes corresponding to the states where white or black 
has won, respectively. It can be estimated directly from 
the trajectories as p™™™ = nf / {nf + n\), where and 
n\ are the number of times trajectory visiting bin i ends 
in white's or black's victory, respectively [30l|. 

Preprocessing of the chess game trajectories. 
The ensemble of the chess game trajectories consists of 
10000 games played by the GNUCHESS program against 
itself with default parameters. The positions during 
highly dynamic phases of the game (e.g., during exchange 
of pieces) can not be accurately evaluated with the em- 
ployed evaluation function [l^. To alleviate these prob- 
lems the trajectories were processed as follows. Initially, 
every continuous sequence of exchanges of pieces is cut 
off from the game trajectories. The trajectories were 
projected to the evaluation function coordinate by com- 
puting the evaluation function for every second position 
with white turn to move (every fourth ply; ply is a sin- 
gle move either by white or black). The trajectories then 
were transformed (normalized) to the natural coordinate. 
To avoid highly dynamic phases of the game other than 
the continuing sequence of exchanges of pieces, the steps 
with change in the value of the reaction coordinate larger 
than the threshold value of 5 (along the natural coordi- 
nate) were discarded. The first two moves of every game 
were discarded, to remove the very early development 
phase of the game and keep the game dynamics homo- 
geneous. More sophisticated variants of the evaluation 
function should make these preprocessing steps unneces- 
sary. 
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